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Abstract 

Orcadian- and sleep disturbances may be central for understanding the pathophysiology and 
treatment of depression. The effect of melatonin on depression/depressive symptoms has been 
investigated previously. This systematic review assesses the current evidence of a therapeutic- 
and prophylactic effect of melatonin in adult patients against depression or depressive 
symptoms. A search was performed in The Cochrane Library, PubMed, EMBASE and PsyclNFO 
for published trials on November 14th 2013. Inclusion criteria were English language, RCTs or 
crossover trials. Our outcome was measurement of depression/depressive symptoms with a 
validated clinician-administered or self-rating questionnaire. PRISMA recommendations were 
followed and the Cochrane risk-of-bias tool used. Ten studies in 486 patients were included in 
the final qualitative synthesis and four studies, 148 patients, were included in two meta- 
analyses. Melatonin doses varied from 0.5-6 mg daily and the length of follow-up varied from 
2 weeks to 3.5 years. Three studies were done on patients without depression at inclusion, two 
studies in patients with depression and five studies included a mixture. Six studies showed an 
improvement in depression scores in both the melatonin and placebo groups but there was no 
significant difference. One study showed a significant prophylactic effect and another found 
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a significant treatment effect on depression with melatonin compared to placebo. The two 
meta-analyses did not show any significant effect of melatonin. No serious adverse events were 
reported. Although some studies were positive, there was no clear evidence of a therapeutic- or 
prophylactic effect of melatonin against depression or depressive symptoms. 
© 2014 Elsevier B.V. and ECNP. All rights reserved. 



1. Introduction 

Depression is a common mental disorder and WHO estimates 
that more than 350 million people of all ages suffer from 
depression worldwide (www.who.int - 2013). Depression is 
the leading cause of disability worldwide, and is a major 
contributor to the global burden of disease (www.who.int- 
2013). The antidepressants most commonly used today are 
thought to exert their actions based upon the classical 
mono-amine deficiency hypothesis (Belmaker and Agam, 
2008). New approaches are in focus because the clinical 
effects of antidepressants used today can take up to two or 
more weeks to emerge, early discontinuaton rates are high, 
full remission rates are low and side effects are common 
(Malhi et al., 2013; Papakostas, 2008, 2010). 

Orcadian rhythms have been shown to be fundamentally 
disturbed in patients with depression (Germain and Kupfer, 

2008) and insomnia and early-morning awakening are pro- 
minent symptoms of clinical depression (Germain and 
Kupfer, 2008; Srinivasan et al., 2009), suggesting a close 
link between the melatonergic system and depressed mood 
(Malhi and Kuiper, 2013; Pandi-Perumal et al., 2009; 
Srinivasan et al., 2009). Addressing the chronobiology of 
mood disorders with novel melatonin-based therapies, such 
as agomelatine has shown significant antidepressant effects 
with a favorable adverse effect and safety profile (Hickie 
and Rogers, 2011; Koesters et al., 2013). However, it is not 
known whether the antidepressant action of agomelatine is 
a synergistic effect or independently linked to the MT1 /MT2 
receptor binding properties or to the affinity to the 5-HT 2c 
serotonin receptor (de Bodinat et al., 2010). 

Major depressive disorder (MDD) was proposed as the 
"low melatonin syndrome" in 1979 (Wetterberg, 1979); a 
concept that focuses on low melatonin secretion as a 
biological marker of depression. As a consequence of this 
theory many studies investigating the secretion of melato- 
nin in depression have followed (Srinivasan et al., 2009), 
although the results are ambiguous with findings of 
decreased and increased secretion (Srinivasan et al., 

2009) . Melatonin itself is known to promote sleep, and 
synchronization and entrainment of the circadian clock; all 
effects that could contribute to the possible antidepressant 
effect (Boyce and Hopwood, 2013). In addition, exogenous 
melatonin has shown anti-depressant-like effects in animal 
models (Binfare et al., 2010; Detanico et al., 2009; 
Raghavendra et al., 2000). Since the first clinical study 
investigating the antidepressant effect of exogenous mela- 
tonin in humans in 1976 (Carman et al., 1976) showing 
negative effects, several other studies have been conducted 
with both positive and negative findings and therefore no 
clear evidence of the effect of melatonin on mood has been 
concluded (Bellipanni et al., 2001; Dalton et al., 2000; 



Danilenko and Putilov, 2005; Del et al., 2013; Fava et al., 
2012; Jean-Louis et al., 1998; Lewy et al., 1998). 

The aim of this systematic review and meta-analysis was 
to assess the current evidence of a therapeutic- and 
prophylactic effect of melatonin in adult patients against 
depression or depressive symptoms. 

2. Experimental procedures 

This systematic review was conducted following The Preferred Report- 
ing Items for Systematic reviews and Meta-Analyses (PRISAAA) (Liberati 
et al., 2009), except that no formal review protocol was published. 

2.1. Eligibility criteria 

We used the following PICO (P: population, I: intervention, C: 
comparator/control, O: outcomes) (Liberati et al., 2009) when 
constructing the eligibility criteria and the search strategy; P: humans, 
adults +18 years, I: melatonin, C: to placebo or other active 
treatment and O: depression, depressive symptoms, mood disorders. 

English language, randomized controlled trials (RCT) (with an 
inactive placebo or active treatment control group) or randomized 
crossover trials assessing the treatment or prophylactic effect of 
melatonin for depression or depressive symptoms were included. 
Our outcome was the measurement of depression, regardless of 
whether this was assessed as a primary, secondary or tertiary 
outcome in the studies. The outcome measure had to be either a 
validated clinician-administered or self-rating questionnaire with 
known psychometric properties and with a main focus on diagnosing 
and/or measuring depression. Results had to be reported in the 
manuscript in mean (SD) or median (IQR/range). The patient group 
had to have a depression or be predisposed in some way to the 
development of a depression or depressive symptoms (i.e. patients 
with chronic illnesses, cancer or sleep disturbances) and therefore 
studies on healthy volunteers were not included. In addition, 
inclusion was not restricted to a specific dose of melatonin or route 
or timing of administration, nor restricted to the primary and 
secondary outcomes otherwise being assessed in the studies. 

Exclusion criteria were: other study designs than RCT or crossover, 
studies without results in figures or numbers, melatonin not given as 
an intervention, depression or depressive symptoms not being mea- 
sured, language not English, an unclear (too general or not specific for 
depression) measurement instrument used (i.e. VAS for mood or MOS- 
SF-36) or if the effect of melatonin could not be isolated from other 
interventions (trials with co-intervention bias i.e. melatonin given 
together with buspirone, sleep deprivation, or light therapy). 

2.2. Information sources and the search 

The Cochrane Library, PubMed, EMBASE and PsyclNFO were 
searched for published trials on November 14th 2013 with the 
following search strategy: 

((melatonin) AND (depression OR depressive disorders OR mood 
disorders OR depressive symptoms) AND (therapeutics OR treat* OR 
effect*)). 
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When constructing the search terms, words from the PICO were 
used in MESH term searches to find words to expand and improve 
the search strategy. Truncation was used to cover different word 
endings. Limits used when searching the databases were as follows: 

PubMed: humans, 19+ years, randomized controlled trial, 
clinical trial and controlled clinical trial 
The Cochrane Library: no limits 

EMBASE: humans, adults <18 to 64 years> or aged >65+ 
years, all clinical trials 

PsyclNFO: humans, adults 18+years, treatment outcome/clin- 
ical trial. 



No limits were set for language, so all records identified were 
manually screened with regard to being English. No limits were set 
with regard to the years of publication. The reference list of all 
included studies was manually reviewed to identify additional relevant 
studies. No search was performed for ongoing trials or unpublished 
studies and no contact with the study authors was made. 

2.3. Study selection 

All records identified by the database searches were manually 
screened for duplicates. Based on a screening of all the titles/ 
abstracts, studies that were clearly not relevant were excluded. 
Full-text articles of the remaining studies were assessed for 
eligibility by two independent authors (MVH, AKD). Disagreements 
on inclusion were resolved by discussion between the two authors. 



2.4. Data collection 

Information about participant characteristics, intervention details, 
outcome measures and adverse effects was extracted indepen- 
dently by two authors (MVH, AKD) using a predesigned data form. 
Studies including patients with a depression at inclusion were 
classified as "treatment" and studies including patients without a 
depression at the time of inclusion as "prophylactic". Outcome data 
were extracted from the text, tables and figures. For the studies 
included in the meta-analyses, data needed to be presented as 
mean (SD), otherwise data were converted to this format 
(SD=SEMx yVi). Furthermore, geometrical mean was assumed as 
mean. When data were presented as changes in a score, the actual 
score was calculated by adding/subtracting the changes to/from 
the baseline value for the two groups. If the time of measurement 
was not exactly the same in the different studies, the closest 
possible time was chosen. If a cross-over trial was included in a 
meta-analysis all intervention measurements from the 2 groups 
were analyzed as if the trial were a parallel group trial, as 
suggested by The Cochrane Handbook for Systematic Review of 
Interventions (Higgins et al., 2011). Data on adverse events were 
only recorded for the melatonin group alone. 

2.5. Data synthesis 

We performed the data synthesis and statistical analysis using 
Review Manager software (Review Manager (RevMan) [Computer 
program], Version 5.2. Copenhagen: The Nordic Cochrane Centre, 
The Cochrane Collaboration, 2012.). Since the population was 
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Figure 1 PRISMA flow diagram. 
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varied, we included all types of age groups, dosing regimens, 
diseases and study sizes. Due to this variation, a random-effects 
model was suitable for the meta-analysis. Separate meta-analyses 
were performed if the following three criteria were met: the 
studies used the same rating scale for depression/depressive 
symptoms, the studies were either therapeutic or prophylactic 
and it was possible to extract data needed for the meta-analyses 
directly from the text, tables or figures. Accordingly, we were able 
to perform two meta-analyses: melatonin vs. placebo using Beck 
Depression Inventory, and Hospital Anxiety and Depression Scale. 
We analyzed data from the questionnaires as continuous data using 
an inverse variance method and presented these as mean differ- 
ences when outcome measures were on the same scale. We 
expressed the overall results for our primary outcome as effect 
size with 95% confidence intervals. We assessed statistical hetero- 
geneity with the I 2 statistic, thereby estimating the percentage of 
total variance across studies that was due to heterogeneity rather 
than chance. 

We were not able to assess publication bias or small study effects 
in a qualitative manner using a funnel plot, as there were less than 
10 studies included in the meta-analyses. 

2.6. Risk of bias 

The two authors (MVH, AKD) also assessed the methodological 
quality of the included studies by using the Cochrane risk-of-bias 
tool, consisting of 6 items (Higgins et al., 201 1 ). Two items assessed 
the strength of the randomization process in preventing selection 
bias in the assignment of participants to the interventions; 
adequacy of sequence generation and allocation concealment. 
The third item assessed the influence of performance bias (blinding 
of participants and personnel) and the fourth item assessed 
detection bias (blinding of outcome assessors). The fifth item 
assessed attrition bias and whether incomplete outcome data were 
adequately addressed. The sixth item assessed whether selective 
reporting exists (reporting bias) and this item requires a comparison 
of published data with a trial protocol. Trial protocols for all 
included studies were searched for in two databases: www.clinical 
trials.gov and WHO'S Trial Search (http://apps.who.int/trialsearch/). 
Randomized cross-over trials were separately assessed for risk of bias 
with regard to whether statistical consideration was given to the 
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carry-over and/or period-effect (Hills and Armitage, 1979) of this 
study design. 

3. Results 

The database searches identified 372 records. After removal 
of duplicates, 304 titles/ abstracts were screened, and 
60 full-text articles were subsequently assessed for elig- 
ibility. Of these full-text articles, 50 were excluded due to 
various reasons (Figure 1) and 10 studies (Chojnacki et al., 
2011; Garzon et al., 2009; Hurtuket al., 2011; Kripke et al., 
2006; Rahman et al., 2010a; Riemersma-van der Lek et al., 
2008; Serfaty et al., 2010; Song et al., 2005; Lu et al., 2005; 
Williams et al., 2002) were included in the final qualitative 
synthesis, of which it was possible to include 4 studies 
(Chojnacki et al., 2011; Lu et al., 2005; Serfaty et al., 2010; 
Song et al., 2005) in 2 meta-analyses (Figures 2 and 3). 
Table 1 summarizes participant characteristics, study design 
and intervention details and Table 2 summarizes outcome 
measures and adverse effects of the included studies. 

Of the 10 studies included, 5 were randomized placebo- 
controlled trials (Chojnacki et al., 2011; Kripke et al., 2006; 
Riemersma-van der Lek et al., 2008; Serfaty et al., 2010; 
Song et al., 2005) and 5 were randomized crossover trials 
(Garzon et al., 2009; Hurtuk et al., 2011; Lu et al., 2005; 
Rahman et al., 2010a; Williams et al., 2002). The number of 
included patients ranged from 17-189, melatonin doses 
varied from 0.5 mg to 6 mg daily and the length of follow- 
up varied from 2 weeks to 3.5 years. Three studies were 
done on patients without depression at the time of inclusion 
(prophylactic) (Garzon et al., 2009; Lu et al., 2005; Song et al., 
2005), 5 studies included a mix of patients with/without 
depression at inclusion (Kripke et al., 2006; Hurtuk et al., 
2011; Rahman et al., 2010a; Riemersma-van der Lek et al., 
2008; Williams et al., 2002) and two studies (therapeutic) 
included only patients with depression (mild/moderate depres- 
sion on Beck Depression Inventory Diagnostic and Statistical 
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Figure 2 Forest plot - HAD. HAD: The Hospital Anxiety and Depression Scale. 
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Figure 3 Forest plot - BDI. BDI: Beck Depression Inventory. 
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Table 2 Outcome measures and adverse effects. 
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CES-D: The Center for Epidemiological Studies Depression Scale, QJDS-SR: Quick Inventory of Depressive Symptomatology self-rating version, BDI: Beck Depression Inventory, HDRS (17 and 


21 -item): Hamilton Depression Rating Scale, GDS: Yesavage Geriatric Depression Scale, CSDD: Cornell Scale for Depression in Dementia, HAD: The Hospital Anxiety and Depression Scale, 
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a No significant difference compared to placebo. 
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Manual of Mental Disorders (4th edition) diagnosis of MDD, 
respectively) (Chojnacki et al., 2011; Serfaty et al., 2010). 
Eight different scales were used to measure depression; The 
Center for Epidemiological Studies Depression Scale (CES-D), 
Quick Inventory of Depressive Symptomatology self-rating ver- 
sion (QIDS-SR), Beck Depression Inventory (BDI), Hamilton 
Depression Rating Scale (HDRS 17 and 21 -item), Yesavage 
Geriatric Depression Scale (GDS), Cornell Scale for Depression 
in Dementia (CSDD) and The Hospital Anxiety and Depression 
Scale (HAD). Seven studies used only self-rating scales 
(Chojnacki et al., 2011; Garzon et al., 2009; Hurtuk et al., 
2011; Kripke et al., 2006; Lu et al., 2005; Song et al., 2005; 
Williams et al., 2002), 2 studies used both a self-rating and a 
clinician-administered rating scale (Serfaty et al., 2010; 
Rahman et al., 2010a) and 1 study only used a clinician- 
administered scale (Riemersma-van der Lek et al., 2008). Only 
one of the studies had amelioration of depressive symptoms as 
a primary outcome (Rahman et al., 2010a). 

3.1. Effect of melatonin 



respectively. This meta-analysis did not show a significant 
effect of melatonin versus placebo on HAD score (Relative 
effect 0.97, 95% confidence interval (CI) -0.84 to 2.78). 

Another meta-analysis (Figure 3) included 2 other studies 
(Chojnacki et al., 2011; Serfaty et al., 2010) in patients with 
ulcerative colitis and MDD with early-morning awakening 
respectively. In both studies patients had a depression at 
the time of inclusion and depression was measured by Beck 
Depression Inventory (BDI). The closest time-points chosen 
to extract data were 3 months and 5 weeks respectively. 
This metaanalysis did not show a significant effect of 
melatonin versus placebo on BDI (Relative effect -1.09, 
95% confidence interval (CI) -2.60 to 0.42). 

3.3. Risk of bias 

Overall findings are presented in the risk of bias summary 
(Figure 4). Selection bias was low as all studies adequately 
described the method used to generate the random 
sequence. Five studies were unclear with regard to the 



Six studies (Chojnacki et al., 2011; Hurtuk et al., 2011; 
Kripke et al., 2006; Lu et al., 2005; Serfaty et al., 2010; 
Song et al., 2005) showed an improvement in depression 
scores in both the melatonin and placebo groups but there 
were no significant differences between the two groups. 
Two studies did not show an effect of melatonin on 
depression scores and since the results from the placebo 
group were not clearly reported, it is not known whether an 
effect was found in this group (Riemersma-van der Lek 
et al., 2008; Williams et al., 2002). 

One randomized crossover study with 18 non-depressed 
elderly patients evaluating the effect of 5 mg melatonin on 
sleep, and behavioral disorders showed a significant 
improvement (p=0.043) measured by the GDS (mean base- 
line: 7.33, mean melatonin: 5.61, mean placebo: 7.06) in 
the melatonin group compared to baseline and placebo 
after 8 weeks of treatment (Garzon et al., 2009). Another 
randomized crossover study also using 5 mg melatonin for 
4 weeks in 20 patients with Delayed Sleep Phase Syndrome 
showed a significant improvement (p<0.05) in depression 
measured by HDRS-17 (data extracted from figure: mean 
baseline: 13.4, mean melatonin: 6.0, mean placebo: 13.0) 
and CES-D (data extracted from figure: mean baseline: 33.5, 
mean melatonin: 19.0, mean placebo: 25.0) compared to 
placebo and baseline, although this was only in the group 
with depression at inclusion (n=8) (Rahman et al., 2010a). 

3.2. Meta-analyses: (Figures 2 and 3) 

One meta-analysis (Figure 2) included 2 studies; one RCT 
(Song et al., 2005) and one randomized cross-over trial 
(Lu et al., 2005), in patients with irritable bowel syndrome 
who did not have a depression at the time of inclusion. 
Depression was measured by The Hospital Anxiety and 
Depression Scale (HAD). One other study (Williams et al., 
2002) also used HAD but since data were not reported for 
placebo, it was not possible to include this study in the 
meta-analysis. To be able to extract data from the 2 studies 
we chose the closest time-points for measurement of depres- 
sion/depressive symptoms; after 2 and 8 weeks of treatment 
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Figure 4 Risk of bias table. 
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allocation concealment (Chojnacki et al., 2011; Hurtuk 
et al., 2011; Lu et al., 2005; Rahman et al., 2010a; 
Williams et al., 2002). Blinding of participants and person- 
nel was adequately described in all studies, except one 
(Rahman et al., 2010a) and blinding of outcome assessors 
was adequately described in 5 studies (Garzon et al., 2009; 
Kripke et al., 2006; Riemersma-van der Lek et al., 2008; 
Serfaty et al., 2010; Song et al., 2005). Four studies had a 
high risk of attrition bias (Hurtuk et al., 2011; Kripke et al., 
2006; Lu et al., 2005; Williams et al., 2002) with many 
randomized patients who dropped out or were excluded, 
and who were not adequately accounted for. There was a 
high risk of reporting bias as only one study (Kripke et al., 
2006) had an available protocol. The outcomes reported in 
this protocol (NCT00288262) were in complete accordance 
with what was reported in the article, giving this study 
(Kripke et al., 2006) a low risk of reporting bias. Of the five 
cross-over trials (Garzon et al., 2009; Hurtuk et al., 2011; Lu 
et al., 2005; Rahman et al., 2010a; Williams et al., 2002), 
two of them had a high risk of bias with regard to the 
statistics (Garzon et al., 2009; Rahman et al., 2010a). 

3.4. Adverse events 

Three studies (Garzon et al., 2009; Hurtuk et al., 2011; 
Riemersma-van der Lek et al., 2008) did not report any 
adverse events in the melatonin group and 3 studies 
(Rahman et al., 2010a; Song et al., 2005; Williams et al., 
2002) did not report on adverse effects at all. In the 
remaining 4 studies (Chojnacki et al., 2011; Kripke et al., 
2006; Lu et al., 2005; Serfaty et al., 2010) mild daytime 
sleepiness, headaches, poor sleep, vivid dreams and a fuzzy 
feeling were reported as adverse effects in the melatonin 
groups but there was no significant difference compared to 
placebo. 

4. Discussion 

This present systematic review does not support routine 
administration of melatonin for treatment or prophylaxis of 
depression, although two small studies (one prophylactic 
and one treatment) did find an effect compared to placebo 
and no studies found the opposite. The ten included studies 
were rather small with regard to number of included 
patients and heterogeneous with regard to type of patients 
included, melatonin dose, timing and duration of interven- 
tion, outcome measurement instrument and timing of 
measurement. 

It is noteworthy, that six of the included studies showed 
an improvement of depression in both the melatonin and 
placebo groups over the course of time, showing that a 
placebo-response is probably present (Walsh et al., 2002; 
Rutherford and Roose, 2013). The placebo- response in 
antidepressant trials has risen over the last decades, leading 
to a reduction in the medication-placebo differences 
(Rutherford and Roose, 2013; Walsh et al., 2002). Conse- 
quently, it is becoming increasingly difficult to demonstrate 
a significant benefit of a putative antidepressant, such as 
melatonin, over placebo (Rutherford and Roose, 2013). This 
could be an explanation why an overall evident effect of 
melatonin was not found in this review including studies 
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from 2005-2011 and future studies taking the placebo- 
response into consideration are necessary. Furthermore 
our review did not include any studies in patients with 
severe, elongated depression needing hospitalization; this 
population possibly contributing to a larger response rates 
of antidepressants against placebo (Rutherford and Roose, 
2013; Walsh et al., 2002). 

This review solely included studies using validated 
clinician-administered or self-rating questionnaires with 
known psychometric properties and a specific main focus 
on diagnosing and/or measuring depression. This was done 
to improve both the internal and external validity of this 
review. Consequently, it made comparability within the 
review possible and gave the opportunity to complete 
meta-analyses. Finally there is a greater clinical relevance 
of the results by making comparability and generalizability 
of the results an option in respect to other previous or 
future clinical studies on depression and depressive symp- 
toms. However, in the context of the remaining literature 
with studies using other non-specific questionnaires/ rating 
scales to assess mood and depression, melatonin has shown 
a positive effect in some studies (Bellipanni et al., 2001; 
Fava et al., 2012; Jean-Louis et al., 1998; Lewy et al., 1998; 
Lissoni et al., 1995, 2000; Suresh Kumar et al., 2007). 
Whether the positive effect of melatonin found in these 
excluded studies is due to the non-specific measuring 
instrument of mood/depression used or due to co- 
intervention with a possible add-on effect of melatonin in 
some of the studies is not known, but should be an issue for 
future investigations. 

An aspect that could have confounded the results of this 
review is the inclusion of cross-over trials as seasonal 
variation (Kasper et al., 1989) and spontaneous fluctuations, 
especially spontaneous improvement, in mood and depres- 
sion over time are evident (Rutherford et al., 2012; 
Rutherford and Roose, 2013). We chose to include rando- 
mized cross-over trials in our review, as they constitute a 
large group of the available evidence and furthermore, we 
assessed the carry-over effect to be minimal due to the 
short half-life of melatonin (Cardinali et al., 2012). This 
leaves only the aspect of the period effect as a risk of bias, 
which we judged did not have enough effect to exclude the 
evidence from these studies in the review. When including 
both an RCT and a randomized cross-over trial in a metaa- 
nalysis, we also knew it would give rise to a unit-of-analysis 
issue, but since the results from the randomized cross-over 
trial approximate those from the RCT in a comparable 
population (Song et al., 2005), we accepted the conserva- 
tive estimate (Higgins et al., 2011). 

Presently, we do not find the evidence on this topic 
adequate and comprehensive enough and in light of the 
heterogeneity of the included studies and the quality of the 
evidence being low, we believe further studies are war- 
ranted. The first question that arises for these future 
studies is to determine whether a possible effect of 
melatonin is confined to those patients who are depressed 
or if melatonin is also applicable as prophylaxis. One study 
has shown that low endogenous melatonin secretors who do 
not have the normal nocturnal peak in secretion are more 
likely to have depressive symptoms and subsyndromal 
depression (Rahman et al., 2010b), which could support 
the rationale of treatment with melatonin to prevent the 
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development of depression. If melatonin has a treatment 
effect on depression, it must also be evaluated whether 
melatonin only has an effect in those patients who exhibit a 
clear chronobiological disturbance in their depression (Boyce 
and Hopwood, 2013; Malhi and Kuiper, 2013). Furthermore, 
whether melatonin could have a possible synergistic effect 
when given together with other antidepressants (Hirsch- 
Rodriguez et al., 2007), or other treatments such as bright 
light (Riemersma-van der Lek et al., 2008), tamoxifen (Lissoni 
et al., 1995), 5-methoxytryptamine (Lissoni et al., 2000) or 
buspirone (Fava et al., 2012) and thereby be used as an add-on 
must be investigated. In view of the longer half-life and much 
higher potencies of the different melatonin agonists, such as 
ramelteon, future studies are also warranted with melatonin 
in higher doses before the relative effect of melatonin can be 
clarified (Cardinali et al., 2012). As many of the included 
studies were relatively small in size, larger studies should be 
conducted to exclude the possibility of a type II error. 
Furthermore, future studies should be performed as RCTs in 
order to rule out any bias related to the carry-over and period 
effect linked to cross-over trials. 

In summary, this systematic review does not support 
routine administration of melatonin for treatment or pro- 
phylaxis against depression until further data are available. 
Thus, further studies are warranted. 
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